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Abstract 

These notes are intended as an introduction to noncommutative (quantum) filtering the- 
ory. An introduction to quantum probability theory is given, focusing on the spectral theorem 
and the conditional expectation as the least squares estimate, and culminating in the con- 
struction of Wiener and Poisson processes on the Fock space. Next we describe the Hudson- 
Parthasarathy quantum Ito calculus and its use in the modelling of physical systems. Finally, 
we use a reference probability method to obtain quantum filtering equations, in the Belavkin- 
Zakai (unnormalized) form, for several system-observation models from quantum optics. The 
normalized (Belavkin-Kushner-Stratonovich) form is obtained through a noncommutative ana- 
logue of the Kallianpur-Striebel formula. 



1 Introduction 



The development of a theory of feedback control in the 1950s and 60s has provided a huge sti- 
mulus to the engineering disciplines. Nowadays all but the simplest of devices that make up our 
everyday lives use feedback to provide efficient and reliable performance despite the ever increas- 
ing complexity and miniaturization. However, at a time when microtechnology is making way for 
nanotechnology, we are rapidly approaching the boundary of the classical world past which the 
effects of quantum mechanics cannot be neglected. The theory of quantum mechanics tells us that 
any description of the phenomena at small scales is inherently nondeterministic in nature. This 
opens new areas of application for stochastic control theory, which will likely play an important 
role in a future generation of technology. In particular, as observations of quantum systems are 
inherently noisy, the theory of filtering — the extraction of information from a noisy signal — forms 
an essential part of any quantum feedback control strategy. 

These notes are intended as an introduction to quantum filtering theory. We begin by giving a 
brief overview of noncommutative probability theory 0^ , the associated stochastic calculus 
[55] , and the role of this theory in the modelling of quantum dynamical systems [21 IS E^j . We 
then obtain quantum filtering equations T3' for a variety of quantum models. We systematically 
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use change of measure techniques, bringing our approach close to the classical reference probability 
method of Zakai ■ 

The study of quantum filtering and control was pioneered by V.P. Belavkin in a remarkable series 
of articles, dating back to the early 1980s [3 HOI CH [12 CSl CH . A discrete version of his filtering 
equation can already be found in |Hj . After the development of quantum stochastic calculus ^35j , 
continuous time versions of Belavkin's equation appeared in JI] , and JHI ■ The sample paths 
of the solution of the quantum filtering equation |21l 1221 1191 IKK] are known in the continuous time 
quantum measurement community as quantum trajectories. 

The reference probability method used here is inspired by the classical treatment of Zakai j56| . In 
this approach we change the underlying measure so that the filtering problem reduces to elementary 
manipulations of the conditional expectation with respect to the new (reference) measure. The 
key idea that allows us to apply this approach to the quantum case is the observation that the 
Radon-Nikodym derivative must be compatible with the observations that we are conditioning 
on. To obtain a change of measure that satisfies this property we employ a technique that to our 
knowledge first appeared in a paper by Holevo [HII (see also |12l 1291 IH]). which replaces here the 
Girsanov tranformation in Zakai's treatment. Apart from this point, the derivation is essentially 
classical, i.e. we derive the Belavkin-Zakai equation and obtain the (nonlinear) Belavkin-Kushner- 
Stratonovich equation via a noncommutative analogue of the Kallianpur-Striebel formula. 

In contrast to the martingale techniques used in [131 118| . the reference probability method is 
independent from the innovations conjecture (as is the case in classical filtering). Moreover, 
the reference probability approach consists only of elementary manipulations of the (quantum) 
conditional expectation. As in classical filtering theory the Kallianpur-Striebel formula has found 
a wide range of applications, it is our hope that a similar approach will be fruitful in the quantum 
context and will clarify some of the existing literature on quantum filtering theory. 

After the filtering equations have been obtained, methods from classical nonlinear stochastic con- 
trol can be applied |10l I52L [T7| to design control laws. Thus the presence of quantum mechanics 
in quantum feedback control remains limited to the filtering procedure. Recent experiments im- 
plementing quantum feedback controls ^ 123 Ell have led to renewed interest in the field which is 
now rapidly expanding gmSSiniEElEZIESlESlEIlESllIHI- The interaction between the areas of 
stochastic control and theoretical and experimental physics is essential in paving the way towards 
the engineering of quantum technologies. 

There are three key ingredients that are required for the development of quantum filtering theory. 
First, we need to capture both classical probability and quantum mechanics within the framework 
of a generalised probability theory, called noncommutative or quantum probability theory. The 
central object in this theory, the spectral theorem, allows us to make a seamless connection between 
quantum systems and the associated classical probabilistic measurement outcomes. Second, we 
need a noncommutative generalization of the concept of conditional expectations. As in classical 
probability, we will find that a suitably restricted definition of the quantum conditional expectation 
is none other than a least squares estimator, which elucidates its role in quantum filtering theory. 
Finally, we need a noncommutative analogue of a stochastic calculus and of stochastic differential 
equations. This construction provides a broad class of models for which we can obtain quantum 
filtering equations. 
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In the following we develop each of these themes in turn. Section [21 introduces noncommutative 
probability theory, focusing on the spectral theorem, and introduces the quantum conditional 
expectation as a least squares estimate. In section |31 we show how Wiener and Poisson processes 
emerge in a particular quantum probabilistic model based on the Fock space. The construction 
additionally demonstrates the use of the spectral theorem. Section ^ is an exposition of the 
noncommutative stochastic calculus of Hudson and Parthasarathy. In Section El we introduce the 
system-observation models which we are interested in; they are obtained from physical models via 
a weak coupling limit. Section deals with the derivation of the Belavkin-Zakai and the Belavkin- 
Kushner-Stratonovich equation using the reference probability approach. We conclude these notes 
with some additional examples. 



2 Noncommutative probability theory 



In this section we wish to make the point that quantum mechanics is a probability theory in which 
the random variables, called observables in quantum mechanics, are allowed to be noncommutative. 
Indeed, in quantum theory observables are represented by self-adjoint operators on some Hilbert 
space which in general need not commute. In order to incorporate quantum mechanics in the 
framework of probability theory we will weaken the axioms of the latter to allow for noncommuta- 
tive random variables. By considering commuting observables only, classical probability theory can 
be recovered using the spectral theorem. The latter asserts that commuting self-adjoint operators 
can be diagonalised simultaneously, allowing them to be interpreted simultaneously as functions 
(i.e. random variables) on the diagonal (the spectrum). Excellent introductions to noncommutative 
probability theory are [451 ITS] , see also _16, _52^ . 

As quantum probability is algebraic in nature, it is instructive to begin by forming an algebraic 
picture of classical probability theory. Consider a classical probability space (r2,I],P). Then 
the space A = i°°(rj,S,P) of bounded measurable complex functions on H. (we will always use 
complex function spaces L°°, etc.) is a *-algebra: any complex linear combination, pointwise 
multiplication, and conjugate of functions in A are still in A. We can use integration with respect 
to P to map any element of ^ to a complex number. If we operate this map P : ^ ^ C on 
an indicator function, we obtain the probability of the corresponding event. As the algebra only 
contains measurable functions, and as it is in fact generated by the indicator functions (any element 
in L°° can be approximated arbitrarily well by linear combinations of its indicator functions), the 
pair (-4, P) encodes the same information as the probability space (r2,E,P) (at least up to sets 
of measure zero |45| . a point on which we will not dwell further.) It is convenient to represent 
the elements of the algebra A = L°°{^1, S, P) as operators on the Hilbert space — L^{^, S, P). 
This can be done by letting A act on via pointwise multiplication (note that A-tp is an element 
of ,if for a.nyAGA,ipe Jif). 

Let us now generalize these ideas. Denote by B{J^) the algebra of all bounded operators on a 
Hilbert space , and let S C 6(Jf ). The set S' := {X e e(^); XS = SX, VS* G S} is called 
the commutant of S in B{Jif). A von Neumann algebra A on is a *-subalgebra of B{J^) such 
that A equals its double commutant, i.e. A" = A. Von Neumann's double commutant theorem 
then asserts that A is closed in the weak operator topology, i.e. if for some net {Aj} G A the inner 
products {x, Ajy) {x, Ay) for a\lx,y S , then A is an element of A. This property guarantees 
that a von Neumann algebra is generated by its projections P E A, P = = P* |38| . 
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From A' = ^ it immediately follows that the identity / is an element of the von Neumann algebra 
A. A state on ^ is a linear map P : ^ C such that P is positive in the sense that P(A*A) > 
for all A € and such that P is normalised, i.e. P(/) = 1. A state is called normal if it is weak 
operator continuous on the unit ball of A. It is called faithful if f{A*A) — implies ^4 = 0. We 
remark that S, P) is a von Neumann algebra on S, P) with normal state P. 

The following theorem (see P3 sections 9.3-9.5 for a proof) is at the heart of spectral theory. 

Theorem 2.1: (Spectral Theorem) Let C be a commutative von Neumann algebra and P a 
normal state on C. Then there is a probability space (r2,I],P) such that C is *-isomorphic to 
L°°(f7, S, P), the space of all bounded measurable functions on ft. Furthermore, if we denote the 
* -isomorphism by l : C — > L°°{fl, E, P) then we have 

P(C) = / L{C)P{dLu), C eC. 
Jn 



Example. We are guided by the elementary case of a Hilbert space with dimension n < oo. Any 
linear operator on such a space can be represented as a (complex) n x n matrix, and any *-algebra 
of matrices is a von Neumann algebra. Now consider a set of matrices {A^} that commute with 
each other and with all their adjoints, = — V«,j. These matrices generate 

a commutative von Neumann (matrix) algebra C. We can now simultaneously diagonalize every 
element of C using a unitary U. To each A € C, we associate a map l{A) : {1, . . . ,n} ^ C such that 
L{A){i) is the ith diagonal element of the diagonalized matrix U*AU. Then l{A) £ L°°{{1, . . . , n}), 
and l{AB) = l{A)l{B) under pointwise multiplication in L°°. Now choose S = (t(l{C); C G C), 
the sigma-algebra generated by l{C). Finally, given a state P on C, define for F G E the measure 
P(F) = ¥{UC^U*) where Cf^ = xr{i) (xr is the indicator function of F). Then we have explicitly 
constructed a *-isomorphism i : C — > L°°{{1, . . . , n}, S, P). The spectral theorem is an extension of 
these ideas to infinite dimensional operators. Though technically much more involved, the flavour 
of the procedure remains the same. 

We have already described how a classical probability space can be encoded algebraically by a com- 
mutative von Neumann algebra with normal state. The spectral theorem allows us to conclude 
that studying commutative von Neumann algebras equipped with normal states is equivalent to 
studying probability spaces. This motivates the definition of a noncommutative or quantum pro- 
bability space as a von Neumann algebra M equipped with a normal state P. The events in this 
theory are the projections in A/", and the state plays the role of a probability measure. We can now 
see how the various technical properties of von Neumann algebras contribute to their interpretation 
as probabilistic models. Weak closure guarantees that the theory is completely determined by the 
set of all events; and normality of the state is equivalent to countable additivity [iiJij. 

Any physical experiment can be described by classical probability, provided that we consistently 
perform the same measurements. For example, if we measure the position of a particle at time i, 
and repeat this experiment many times with the same initial conditions, then the statistics of the 
measurement outcomes are entirely described by a classical probabilistic model. The break with 
classical probability occurs because in quantum mechanics there exist observables, such as position 
and momentum, that cannot be measured simultaneously in the same experimental realization. 
Such measurements are said to be incompatible, and this is enforced by the fact that they do not 
commute. The spectral theorem provides a concrete mathematical implementation of these ideas: 
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a set of compatible events generates a commutative algebra, and is hence equivalent to a classical 
probabilistic model. Thus if E and F are compatible events we can interpret EF as the event {E 
and F) and E\/ F := E + F — EF as the event {E or F). However, two incompatible events are 
not represented on the same classical probability space, and their joint probability is undefined. 

Having considered the noncommutative analogue of a probability space, let us now turn our at- 
tention to random variables. Let us first investigate a classical random variable f : ^l ^ M.. If / is 
bounded, we could interpret it as an element of the algebra A/" = L°°{il, S, P). In general, however, 
a random variable need not be bounded. Nonetheless we can still consider / as a (self-adjoint) 
operator acting on — L^(ri, E,P) by pointwisc multiplication, provided that we restrict its 
domain to the set of -0 e Jff such that ftp £ M' . To make the algebraic description self-contained, 
we must now express the fact that / is measurable in terms of the algebra M . This can be done 
by requiring that for every Borel set _B C M the indicator function X{ui^Q.:f(ui)^B}i considered as 
an operator on by pointwise multiplication, is an element of M . 

A random variable on a quantum probability space, called an observable, is a sclf-adjoint operator 
on J(f. If an observable F is an element of Af (and hence is bounded), then it generates a 
commutative von Neumann algebra and we can use the spectral theorem to map it to a random 
variable i.(F) on a classical probability space. In general, however, F need not be bounded; we 
must extend the map t to be able to represent unbounded observables as classical random variables. 
To this end, note that since F is self-adjoint its spectrum is real. Therefore we can define two 
bounded commuting operators := (F + and T_ ;= (F — il)^^. The operators and 

T_ generate a commutative von Neumann algebra, and can hence be represented on a classical 
probability space by the spectral theorem. We now extend l as 

._ (4r+))'' + (4T-))-^ 

i-iF) ^ . 

Then l{F) is the representation of F as a classical random variable. Now define the spectral measure 

E{B) = t~^{X{u!enu(F){aj)eB}) 

for any Borel set B of R. E{B) is the event in Af corresponding to {F takes a value in B). F is 

affiliated to a von Neumann algebra C if for all Borel sets B we have E{B) S C. This concept is 
equivalent to measurability of a classical random variable. Finally, note that if the observable F is 
an element of the algebra, then the expectation of F is given by P(F). This is consistent with the 
interepretation of F as a classical random variable i{F), as ¥{F) = Ep[i(F)]. Using the extension 
of i we can extend the state P to unbounded observables by 

P(F) =Ep[i(F)] = [ XF{E{dX)) 
Js. 

Then F{F) is the expectation of the unbounded observable F. 

Definition 2.2: (Conditional expectation) Let {A,P) be a quantum probability space and let 
C C Z = {Z e A; AZ = ZA, VAe A}, the center of A. Then P(-|C) : A ^ C is {a version of) 
the conditional expectation from A onto C if for all AG A, we have F{¥{A\C)C) = F{AC) VC e C. 



The center Z is by definition a commutative algebra. In applications, we begin with a fixed 
probability space {J\f, P) and specify a commutative von Neumann subalgebra Z c M generated 
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by the observations we have performed. We then choose A = Z' ^ the commutant of Z in M . By the 
double commutant theorem, Z is the center of Z' . Def. 12.21 onlv aUows conditioning of operators 
that commute with aU the elements of the commutative algebra Z. This is a natural requirement, 
as there is no need for updating observables that are incompatible with what has already been 
observed. Note that t(P(A|C)) = Ep [t(A)|cr(t(C); C G C)] for any self-adjoint A ^ A, where l is 
given by the spectral theorem applied to the commutative von Neumann algebra generated by A 
and C. Hence for observables the conditional expectation coincides with its classical counterpart. 

Remark. We have taken a more restrictive definition of the conditional expectation than is usual 
in quantum probability, i.e. we do not allow for conditioning on noncommutative algebras. The 
more general definition |5U) does not have a direct physical interpretation but is required for the 
definition of concepts such as a noncommutative Markov process j40j . Beside its direct physical 
meaning, one advantage of our more restrictive approach is that a conditional expectation from A 
onto C in the above sense always exists (see TS' for a "construction" using the central decomposition 
of A) and is unique with probability one. In fact, our conditional expectation is a special case of 
the conditional expectation of |50j . 

Lemma 2.3: Tiie conditional expectation of Def. \2.2\ exists and is unique with probability one {i.e., 
any two versions P and Q ofP{A\C) satisfy \\P - Q\\p = 0, where ||X||^ := ¥{X*X).) Moreover, 
¥{A\C) is the least mean square estimate of A given C, i.e. \\A - ¥{A\C)\\p < \\A - C\\r \/C eC. 

Proof. 1. Existence. We have already established that for self-adjoint A E A, we can explicitly 
define a P(A|C) that satisfies the conditions of Def. 12.21 using the spectral theorem, i.e. 
P(A|C) — i~^(Ep[t(A)|cT(t(C); C e C)]). The classical conditional expectation exists, and 
moreover the conditional expectation of a bounded random variable is bounded. Hence 
P(A|C) exists in C for self-adjoint A E A. But any A G A can be written a,s A — Ai + iA2 
with self-adjoint Ai = {A + A*)/2 and A2 = i{A*-A)/2. As P{Ai\C) andP(A2|C) exist and 
¥{A\C) = F{Ai\C) + i¥{A2\C) satisfies the conditions of Def.ESl existence is proved. 

2. Uniqueness w.p. one. Define the pre-inner product {X,Y) := P{X*Y) on A (it might have 
nontrivial kernel if P is not faithful.) Then (C,A- V[A\C]) = P(C*A) - F{C*F[A\C]) = 
for aU C e C and A £ A, i.e. A - P[A\C] is orthogonal to C. Now let P and Q be two 
versions of F[A\C]. It follows that (C, P - Q) = for aU C e C. But P - Q e C, so 
{P-Q,P~Q) = ||P-Q||2 =0. 

3. Least squares. Note that for all K £ C 
\\A-K\\l=\\A-¥[A\C]+nA\C]-K\\l=\\A-F[A\C]\\l+\\P[A\C]-K\\l>^^^ 
where, in the next to last step, we used that A — P[A\C] is orthogonal to P[j4|C] — K E C. 

□ 

Remark. The usual elementary properties of classical conditional expectations and their proofs 
|54j carry over directly to the noncommutative situation. In particular, we have linearity, positivity, 
invariance of the state P(P(A|C)) = P{A), invariance of C (P(A|C) = AHA eC), the tower property 
P{F{A\B)\C) ^ F{A\C) ifC C B, the module property ¥{AB\C) = BP(A|C) for B e C, etc. As 
an example, let us prove linearity. It suffices to show that Z = aP{A\C) + f3P{B\C) satisfies the 
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definition of ¥[aA + I3B\C), i.e. f{ZC) = f{[aA + [3B)C) for all C G C. But this is immediate 
from the linearity of P and Definition 12.21 

We reemphasize that the conditional expectation P[A|C] is a least squares estimate of A. Hence the 
quantum filtering problem is essentially an estimation problem, just like its classical counterpart. 
We can extend the conditional expectation, as we did for the state P, to be defined for any 
self-adjoint operator that is affiliated to A. This way the least squares estimate given a set of 
observations is defined for any quantum observable that is compatible with these observations. 



3 Stochastic processes on Fock space 



After having briefly discussed the framework of noncommutative probability theory, we now turn 
to one particular quantum probability space. Within this model we will discover many interest- 
ing classical stochastic processes, i.e. a whole family of Wiener processes and Poisson processes. 
However, these processes do not commute amongst each other. An extension of Ito's stochastic 
calculus, due to Hudson and Parthasarathy 35 , unites all these processes again in one noncom- 
mutative stochastic calculus (see Section 0J. In Sectional we shall argue that the model studied 
here appropriately describes the quantised electromagnetic field and its interaction with matter. 

Let M' be a Hilbert space. The symmetric or Bosonic Fock space over Jif is defined as 

oo 
n=l 

We will take to be L^(M), the space of quadratically integrable functions on R, and denote 
JF(L^(R)) simply by JF. The Fock space is closely related to the Wiener chaos expansion in 
probability theory (461 ITS] and from a physics point of view it describes a field of bosonic particles, 
like photons. Then the term L^(R)®=" in the direct sum defining JF, describes the situation where 
there are n photons present. Since photons are bosons they have to be described by symmetric 
wavefunctions, explaining the symmetric tensor product in the definition. 

For every / e Jif we define the exponential vector e(/) G .F by 

oo 

e(/):=l©0^/®". (1) 

n—l ^ 

The linear span V of all exponential vectors is a dense subspace of J-. On the dense domain V we 
define for all / g an operator W{f) by 

W^(/)e(.g) :=exp(-(/,5>-i||/p)e(/ + g), 5 g JT. (2) 

These operators are isometric and therefore uniquely extend to unitary operators, also denoted 
W{f), on The operators W{f) : T ^ T are called Weyl operators and they satisfy the 
following Weyl relations 

2. iy(/)W^(g)=exp(-zIm(/,5))VF(/ + g), /,<7G^. 
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It can be shown 07] that the Weyl operators W{f) (/ G ^) generate the von Neumann algebra 
of aU bounded operators on J^, i.e. vN(VF(/); f e Jf) = B{T) =: W. It follows from the Weyl 
relations that for all / € the family {W{tfy\tesL forms a one-parameter group of unitary 
operators on T. It can be shown to be continuous in the strong operator topology 07]. The 
following theorem is a classic result in spectral theory and can be found for instance in I38| . 

Theorem 3.1: (Stone's theorem) Let {Ut}t£R be a group of unitary operators in some von 
Neumann algebra A, continuous in the strong operator topology. There exists a unique self-adjoint 
operator A affiliated to A such that 

Ut = exp{itA) := [ e'*^E{dX), 

where E denotes the spectral measure of the self-adjoint operator A. 



Since {W(tf)}t^]^ is a strongly continuous one-parameter group of unitaries, Stone's theorem 
provides a self-adjoint operator B{f) such that 

W{tf) = exp {itB{f)). 

The operators B{f) (/ e J^) are called field operators. The domain of the operator B{fk) . . .B{fi) 
contains V for every /i, . . . /fc € ^ and k €N (cf. gH])- For f,g € Jif and t G R it follows from 
the Weyl relations that on the domain V 

1. B{tf)=tBU). 

2. B{f + g)^B{f) + B{g), (4) 

3. [B{f),B{g)]^2ilm{f,g). 

The last relation l@]3) is called the canonical commutation relation. We have introduced it via the 
Weyl operators since they have the advantage of being bounded. 

We fix an a in [0, tt) and denote by Ha the subspace of L^(K) of functions of the form e*"/ with 
/ a real valued function in L^(E). For / G Ha we define, as before, bounded operators T{f)+ := 
{B{f) -\- il)~^ and T{f)^ := {B{f) — il)^^ . It follows from the canonical commutation relation 
that the family {T{f)+,T{f)-; f G Ha} is commutative and therefore generates a commutative 
von Neumann algebra C^. We denote by 4> the vacuum state on W = B{T), given by 4>{X) := 
(<I>,X$) where $ := e(0) = 1 © © © ... G is the vacuum vector. Using Theorem ITTI we 
obtain a probability space {fla, 5^a, Pa) and a *-isomorphism i : Cq — > L°°(fla, Sq, Pq) such that 
4>{C) = Ep^ ['-(C')] for all C G Ca- In a similar fashion as before we can now define t{B{f)) for the 
(possibly unbounded) self-adjoint operator B{f). 

Let us study the random variable L{B{f)) on the probability space (Jla,'^a,'Pa) in some more 
detail by examining its characteristic function 



exp 



(zxt(i?(/)))] = 0( exp {iBixf))^ = W{xf)'i>) = exp ( - ^Ml.^ , g R. (5) 



In the last step we used the definition of the Weyl operator, equation and the relation 
(e(/),e(g)) — exp(/, g), which easily follows from equation Define a random process on 
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(r^cEajPa) by 

:=.(B(e*"XM)), t>0, 
where X[o,t] denotes the indicator function of the interval [0, i]. From the characteristic functional 



/ 



exp ^it(_B(/))^ (/ e Ha) given by equation it follows that W" is a process with 

independent, normally distributed increments — W" (s < t) such that their means are zero 
and their variances are t — s. Summarizing, Wj" is a Wiener process on {fla, Sq,, Pq). 



Identifying the operators B{f) and the random variables L{B{f)), we see that the algebra W of all 
bounded operators on the Fock space J-' contains a whole family (indexed by a G [0, tt)) of Wiener 
processes. Note however, that these processes for different values of a, do not commute. For exam- 
ple, it follows from the cannonical commutation relation (0l3) that [W^ ,Wf^'^] — 2imin{s,t} 0. 
Therefore, for different values of a, these processes can not be represented simultaneously on the 
same probability space via the spectral theorem. It is precisely in this sense that noncommutative 
probability is richer than classical probability. 

The idea to simultaneously diagonalise the fields in the family {i?(e'"x[o,t]); ^ > 0} is implicit is 
some of the earliest work in quantum field theory. However, Segal in the 1950s was the first 
to emphasise the connection with probability theory. Apart from the Wiener processes W^, the 
algebra W contains Poisson processes. Before introducing them, some further preparations have 
to be made. 

The second quantisation of an operator A e B{Jf) is the operator r(^) G W = B{J-) defined by 

oo 

r{A) :=/e0^®". 

n=l 

For all A,B e B{,M') this immediately gives T{AB) = T{A)T{B). Let S" be a self-adjoint element in 
B{J^), then exp(itS') is a one-parameter group of unitaries in B{J^). After second quantisation, 
this leads to a one-parameter group r( exp (its')) of unitaries in W (continuous in the strong 
operator topology). Stone's theorem 13 . II then asserts the existense of a self-adjoint operator A{S) 
on such that for alH £ M 

r(exp(it5)) = exp(itA(S')). 

The domain of a product A(S'i) . . . A(S'„) contains V for all self-adjoint elements Si,...,Sn in 
B{J^) |47j. Denote by Pt the projection ^^(R) — + L'^{M.) : f i-^ X[o,t]f where X[o,t] denotes the 
indicator function of the interval [0,t]. For notational convenience, we abbreviate the operator 
A(Pt) to A(t). On the nth-layer of the symmetric Fock space r(exp(isPf)) acts as exp(isPt)®". 
Differentiation with respect to s shows that on the nth-layer of the symmetric Fock space A{t) = 
Pt <g) J®"-! + / (g, (g, /'»n-2 + _ , + /®n-i g, xhis shows that A{t) is the operator that counts 
how many particles, i.e. photons in the field, are present in the interval [0,t]. In particular we 
therefore have A{t)^ = 0, a property that we will exploit later on. 

Since the family of projections {Pt; t > 0} is commutative, it generates a commutative von 
Neumann algebra J\f. For all self-adjoint elements S and T in A/" we have on the dense domain 2? 

pitA(T)pitA(S)p-itA(T)„-itAiS) _ t y ( itT itS -itT -itS\ _ j 

[m, A(T)] = \. ' fs,- ? ^— - »■ 
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For all self-adjoint S £ Af we can define commuting bounded operators T{S)+ :~ (A(S') + 

and T{S)- := (A(5) — il)^^ . The above ensures that they generate a commutative von Neumann 

subalgebra C of W. For / e iy^(M) we define a coherent vector 4'if) by 

^if) := = W-(/)e(0) = exp ( - ^)e(/). 

A coherent state p on W is defined by p{A) :— {ip{f), Aip{f)) . The spectral theorem 12.11 provides 
a classical probability space (fi, E, P) and a *-isomorphism t : C — > L°°(r^, S, P) such that p{C) = 
Ep[i(C)]. For all self-adjoint operators S in the von Neumann algebra M, we can define as before 
random variables t(A(S')) on (r2,S,P). 

Let us investigate the characteristic function of the random variable t(A(S')) in some more detail 



t(A(S)) 



= (V(/),e"''(^V(/)) = (^(/),r(e"^)^(/) 
= e-l/ll^(e(/),e(e-^/))=e(/'('^"^-^)^). 



The functional S i 



(a(s)) 



shows that the process Nt := t(A(t)) is a Poisson process [23 

on (fi, E,P) with intensity measure j/pdA, where A stands for the Lebesgue measure. Sum- 
marizing, when photons are counted in a laser beam they arrive Poisson distributed. Since 
^(/) = W^(/)e(0) = we could just as well have studied the commutative von Neumann 

algebra W{f)*CW{f), equipped with the vacuum state (j). In this way we get a whole family 
of Poisson processes indexed by / within the quantum probability space (>V,(/)). Again, these 
processes do not commute amongst each other jH5| . 

In this section we have seen that the quantum probability space (W, cj)) contains many interesting 
classical stochastic processes. However, these classical processes do not commute amongst each 
other. We will proceed by discussing the stochastic calculus of Hudson and Parthasarathy PS] 
enabling us to treat the stochastic analysis of all these processes in one framework. 



4 Quantum stochastic calculus 



We start this section with some technical definitions and manipulations to clear the way for defining 
the stochastic integrals of Hudson and Parthasarathy. However, it is not so much the definition of 
the stochastic integrals that is of the greatest importance here. It is the subsequent Ito rule obeyed 
by these stochastic integrals, summarized in their Ito table, that will enable us to put them to good 
use. The Ito rule translates the difficult analysis involved in defining the stochastic integrals into 
simple algebraic manipulations with increments. That is the real strength of having a stochastic 
calculus. We refer to 1851 1151 EHI 14 7j for more extensive treatments of quantum stochastic calculus. 

Let and be Hilbert spaces. The symmetric Fock space has the following exponential property 

(cf. 113) 

® ^2) = ^(^1) ® ^(^2), 

in the sense that there exists a unitary operator U : JF(^ © ,^2) ^ •?^(-^) ® ^{■^2) such 
that for all /i G and /2 G the exponential vector e(/i /2) is mapped to the tensor 
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product e(/i) (g) e(/2). For s < t and / G L'^{R) we write /ij := X(-oo,t]/, /[t := X[t,oo)/ and 
f[s,t] ■= X[s,t]/- Furthermore we have L^(IR) = L^((— oo,t]) © L'^{Jt, oo)), which means that every 
/ in L^(R) can be uniquely written as a sum / = /t] +/[t of elements in oo, i]) and L'^{[t, oo)) . 

Writing Tt] J^(L2((-oo,t])), T{L'^{[t,oo))) and J^f^^tj := T{L'^{[s,t])) , this leads to the 

splitting = Tt\ ® Fyt where, as before, exponential vectors of direct sums are identified with 
tensor products of exponential vectors. Since t can vary continuously through M, T is said to be 
a continuous tensor product. The algebra of all bounded operators on J-' splits in a similar way, 
that is W = B{T) = B{Tt]) ® B{T[t) = W*] (g> W[t (cf. 'Tf ), where we denote Wt] B{Tt]), 
Wit BiT[t) and W^s^t] 

On the domain V we introduce annihilation operators At and creation operators by 

^(S(ix[o,t]) -*S(X[o.t])) and A* i(B(ix[o,t]) + »-B(x[o,t])). 

It can be shown 07j that Ate(/) — {x[o,t], f)e{f) and that is its adjoint on V. This means 
that these operators are the annihilation and creation operators for the mode X[o.t] of the field 
as they are known in physics. Note that At^ = Ate{0) = 0, a property that we will exploit 
in future. We will denote by A( the restriction of A{t) to the domain V. It can be shown 07] 
that {e{g) , Ate{f)) — (5, X[o,t]/) (els); This implies the important relation At<i> = that we 

already encountered in the previous section. 

Let Mt be one of the processes At,A^ or A*. The following /aciorisaiion property I47j underlies 
the definition of the stochastic integral 

{Mt - Ms)e{f) = e(/,])((Mt - Af,)e(/[,,,]))e(/[0, s < t, 

such that {Mt — Ms)e{f[s^t]) G ^[s.t]- We have made our notation lighter by omitting the tensor 
product signs. Let H he a, Hilbert space, called the initial space. We tensor the initial space to 
T and extend the operators At, A^ and At to 7i (8) by ampliation, i.e. by tensoring the identity 
to them on Ti. (however, to keep notation light we will not denote it). Just for mathematical 
convenience we will take Ti to be finite dimensional, i.e. H — C". We denote the algebra of all 
operators on H by B. We are ready for the definition of the stochastic integral. 

Definition 4.1: (Quantum stochastic integral) Let {Ls}o<s<t be an adapted (i.e. Lg £ B^Ws] 
for all < s < t) simple process with respect to the partition {sq = 0, si, . . . , Sp — t} in the sense 
that Ls — whenever Sj < s < sj+i. The stochastic integral of L with respect to M on C" (g) I? 
is then defined as [HKllTTj 

/ LgdM, xe{f) := ^ (^L,^xe{f,^])) ((M.^.^, - M.,)e(/[.„.,+,]))e(/[,^.^J, x € C". 

The notation is simplified by writing dXt = LtdMt for Xt = Xq + LgdMs. The definition of the 
stochastic integral can be extended to a large class of stochastically integrable processes |H5I ET] if 
we approximate these by simple functions and take a limit in the strong operator topology. 



Since At$ = At$ = it is immediate from the definition that quantum stochastic integrals with 
respect to At and At acting on $ are zero, or infinitesimally dAt^[t — dAt^[t = 0. From this we 
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can immediately conclude that vacuum expectations of stochastic integrals with respect to At and 
At vanish. Furthermore, we have LgdA^^^ ~ {la L^dAs^, $) = 0, i.e. vacuum expectation 

of stochastic integrals with respect to A^ are zero as well. Note, however, that dA^^^^ ^ 0. 

To get some more feeling for the definition of the quantum stochastic integral we will now inves- 
tigate which quantum stochastic differential equation is satisfied by the Weyl operators W{ft]) 
(/ £ L'^{M.)). Note that the stochastic integrals are defined on the domain V. Therefore we 
calculate for g and h in L^(K) 

(f>{t) (e(5),Ty(/t])e(/i)) =e-</'^^>-5ll/'ill'(e(g),e(/i + /t])) =e<3^-^'i>-<-^'^''>-^ll-^*!ll'e<9'''>, 
which means that 

0(0 - m = £ (e(5), ^ ((ff, /.]) - - IWfs] IP) Hs)eih))ds. 

Let us turn to the definition of the stochastic integral, Definition 14. II Let {Q ^ sq, si, . . . , Sp — t} 
be a partition of [0, t] and choose Lg = f{sj)W{fs^]) for Sj < s < sj+i. Let further Mt be At, then 
the definition of the stochastic integral gives (heuristically in the last step) 

p-i 
i=o 

p—1 p—1 

E (/(^.)X[a,,.,,,],/i)m4])eW = E - {f{s,)Xs,],h))w{f,^])eih) 

0=0 3=0 




' d{L],h)W{fs])e{h). 



Together with a similar calculation for Mt — Af, this yields the following quantum stochastic 
differential equation for the Weyl operator W{ft]) 

dWift]) - {f{t)dA;^7{t)dAt - ^\fit)fdt}w{ft]). (6) 



Let us return to developing the theory further. To be able to compute with the stochastic integral, 
we need a stochastic calculus, i.e. a quantum Ito rule. The basic ingredient for its proof comes from 
the commutation relations between At, At and A^. For instance, from the canonical commutation 
relation and the definition of At and A^ we see [At, ] = i on the exponential domain V. Therefore 
we have 

(e(/), AtA*te{g)) = {Ate{f), Aeig)) + t{e{f), e{g)) = ((/, X[o,t])(x[o,t] .9)+t) e{g)). 

Infinitesimally this immediately leads to d{AlAt) — A^dAt + AtdAl + dt. The next theorem is 
built on similar ideas, it can be found in |35j and (47) . 

Theorem 4.2: (Quantum ltd rule j35y ) Let Xt and Yt be stochastic integrals of the form 

dXt = BtdAt + CtdAt + DtdA^ + Etdt, 
dYt - FtdAt + GtdAt + HtdA*t + hdt, 
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for some stochastically integrable processes Bt,Ct, Dt, Et, Ft,Gt, Ht and It (see ^51 147f ). The 
process XtYt then satisfies the relation 

d{XtYt) = Xt dYt + (dXt) Yt + dXt dYt, 

on the domain V, where dXtdYt should be evaluated according to the quantum ltd table 





dAt 


dAt 


dA; 


dt 


dAt 





dAt 


dt 





dAt 





dAt 


dA*t 





dA*t 














dt 















i.e. dXtdYt = BtFtdAt + CtFtdAt + BtHtdAl + CtHtdt. 



In the previous section we encountered the classical Wiener processes B{e^°'x[o,t]) = ^°'At — 
ie^^Af for a S [0,7r). Since dB(^e^"xio,t]) = ie~^°'dAt — ie^^dA^, we recover the classical Ito rule 
for these Wiener processes, i.e. ((i_B(e*"x[o,t]))^ — dt, from the quantum Ito rule, as it should. 
For an / e L^{R) we can write the Weyl operator W{ft]) as W{ft]) = exp ( /(s)d(A* - As)). 
Therefore it follows from the quantum Ito rule that the Weyl operators W{ft]) satisfy equation 
©, where ~^\\f\\'^W{ft])dt is the Ito correction term. The Poisson process of the previous section 
was given by A{ := W{f)*AtW{f) in the vacuum state, for which the quantum Ito rule gives 

dA{ = dAt+7{t)dAt + f{t)dA; + \f{t)\^dt, 

which leads to the classical Ito rule {dA{)'^ — dA( for the Poisson process. Furthermore, integrating 
the above equation, we see that A{ = At + B{ift]) + \f{s)\'^ds. 

In the next section, we will show that quantum stochastic differential equations (QSDEs) of the 
following form 

dUt = \^LdA; - L*dAt-^L*Ldt-iHdt^Ut, Uq^I, (7) 

where L and H are in B and H is self-adjoint, emerge naturally from physical models. A standard 
Picard iteration argument [SSI 07] ensures existence and uniqueness of the solution. The adjoint 

Ut satisfies 

dUt = U;\^L*dAt - LdAl - ^L*Ldt + iiJdij, = I. 

From the quantum Ito rule it now immediately follows that d{UfUt) — 0, which means that the 
solution Ut is unitary for all t. The interpretation is that Ut defines a time evolution or flow 
X i-> UtXUt in W, i.e., an observation of X at time t is described by the observable Ut XUt- 

Remark. Despite the fact that both At + A^ and i{At — Al) are classical noises, Eq. ((TJ is not a 
classical stochastic differential equation. To make this idea explicit, rewrite the equation as 

dUt = \iL+i{dAt-dAl) + iL^{dAt+dAl)-]^L*Ldt-iHdt^Ut, Uo = I, (8) 
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where -L+ = (L + L*)/2 and L_ = (i — L*)/{2i). This reveals particularly well that the initial 
system is driven simultaneously by two noncommuting noises. If either — or i_ = 0, we say 
that Ut is essentially commutative 1411. 

Define rt(X) := id <Si (l){Ut X®IUt) for all X ^ B and i > 0. Using the quantum Ito rule and the 
fact that vacuum expectations of stochastic integrals vanish, we find for all X e B 

dTt{X)^ id ®4>{d{U:X<S>IUt)) =Tt{CL,HiX))dt, 

where we have defined the Lindblad generator 

Cl.h{X) = i[H, X] + L*XL - ^{L*LX + XL*L), X e B. 

In quantum probability, this object plays the same role as the infinitesimal generator of a Markov 
diffusion in classical probability theory. Indeed, the semigroup Tt, describing the time evolution 
of the expectation of any system observable, can be written as Tt = exp{tCL,H)- The commutator 
with H describes the unitary evolution of the initial system itself, whereas the residual terms 
describe the irreversible effect that the interaction with the environment has on the initial system. 

A more general case can be treated in a similar fashion by introducing more channels in the field. 
That is, the initial system C" is tensored to k Fock spaces. The QSDE that defines the time 
evolution of the initial system and the field together is given in a general form by 

dUt = {L,dA*{t) + (5y - (5y)dAy (i) - L*S,,dAjit) - [iH + ^L*L,)dt}Uu Ua = I, 

where repeated indices are being summed. The index j on Aj(t) and Aj{t)* labels the annihilator 
and creator on the jth copy of the Fock space, S" is a unitary operator on C"(g)^^({l, 2, . . . , k}) such 
that Sij = {i,Sj), and Ay(i) :— A{Pt (g) \i){j\)- Physically, the term {Sij — Sij)dAij{t) describes 
direct scattering between channels i and j in the field j2j- For this equation we again have existence, 
uniqueness and unitarity of its solution ^7\. Moreover, we obtain the associated generator 

k 

C{X) = i[H,X] + Y,L*XL, - -{L*L„X}, X £ B, (9) 

where {X, Y} stands for the anti-commutator XY + YX. It was shown by Lindblad 1^ that any 
semigroup of completely positive identity preserving operators on B = Af„(C) has a generator of 
the form Eq. ®. 



5 The filtering problem in quantum optics 

Beside the intrinsic interest of quantum probability as a mathematical generalization of classical 
probability theory, many realistic physical scenarios are very well described by QSDEs of the form 
we have discussed. The inherent stochasticity of quantum mechanical problems makes this field 
a rich playground for the application of the classical theories of statistical inference and control 
of stochastic processes. Of course, as in the classical theory, white noise systems are only an 
idealization of physical interactions; a Markov limit of wide-band noise in the spirit of Wong and 
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Zakai gives stochastic models in the Ito form. For a large class of quantum systems, particularly 
those arising in the field of quantum optics, such approximations are extremely good and describe 
laboratory experiments essentially to experimental precision. Though a detailed discussion of the 
physics involved in the modelling of such systems is beyond the scope of these notes, we here briefly 
describe the physical origin of the equations that are widely used in the physics community |26| . 

The basic model of quantum optics consists of some fixed physical system, e.g. a collection of atoms, 
in interaction with the electromagnetic field. The atomic observables are self-adjoint operators in 
a Hilbert space H. The description of the electromagnetic field and its interaction with the atoms 
follows from basic physical arguments (e.g. quantization of Maxwell's equations; see the excellent 
monograph '2(Jj for a thorough treatment of this field, known as quantum electrodynamics.) It 
turns out that the free electromagnetic field, i.e. an optical field in empty space, is described by a 
stationary Gaussian (noncommutative) wide band noise d{t,r) that propagates through space at 
the speed of light c; i.e. if we restrict ourselves to a single spatial dimension, a(<-|-r, z) = a{t, z—ct). 
If we now place the atoms at the origin z = 0, then the quantum dynamics is given by a Schrodinger 
equation of the form 

j^U{t) ^~\[H~ ih{L{t) + L*ma{m ~ ~a(tM\ U(t), 

where L(t) is an atomic (dipole) operator and H is an atomic Hamiltonian. This equation, which 
follows directly from the physical model, is similar to but has wide-band right hand side. One 
generally assumes that the dipole operator is harmonic, L{t) = Le~*"*, but this assumption can 
be relaxed |3- Hence L and H are operators in the initial system B, H being self-adjoint. 

We now wish to approximate this equation by an equation where the noise has infinite bandwidth. 
There is a large body of literature on such approximations in classical probability, following the 
pioneering work of Wong and Zakai (see e.g. 021 and references therein.) A common way to attain 
this limit is to rescale the (wide-band) noise term by 1/e, time as t/e^, and then take the limit 
£ ^ 0. The effect of this rescaling 031 is that the noise bandwidth goes to infinity, whereas its 
energy per unit bandwidth is retained. A similar intuition holds in the noncommutative case; i.e., 
one can show that for the type of model we have described 

1 /■* 

- / L(s/e^)(a*(s/£^,0) - a(s/e^,0))ds — > ^ LA^ as e — >0 (in distribution), 
£ JO 

where 7 > depends on the characteristics of the wide-band noise d and on the dipole rotation 
frequency w |2E1E2- 

The essential question is now the limiting behavior as e — > of the rescaled Schrodinger equation 
^U'{t)=-^ [H -ih{L{t/e^)+L*{t/e^)) {dit/e^,0) ~ d*{t/e^,0)) /e] U'{t). (10) 

The limit e ^ of this equation is known as the weak coupling limit, or the Friedrichs-Van Hove 
limit, in the quantum probability literature. The limit can be studied by expanding the solution of 
Eq. H1U|I in a (Dyson) series using Picard iteration, and then calculating the limit of each term in the 
series [2|31|321E]- In particular, as we are interested in the time evolution U (t)* X {t)U {t) of any 
observable X, one can study the convergence of each term in the series expansion of l}{ty*XU'^{t). 
One finds that U{tY*XU^t) U^XUt as e ^ 0, where 

dUt = I V7 - y/lL*dAt - i(7 + i(T)L*Ldt - iHdt^Ut, Uq = /, 
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with (T G M. The additional term cx L*L is the Ito correction term obtained in the hmit. It is a 
signature of the noncommutativity of the wide band noise that the constant muhiplying the Ito 
correction is complex; the correction consists of a diffusive term ^L*L/2 and a Hamiltonian term 
iaL*L/2. In the following we will absorb the constant 7 into the definition of L, i.e. ^J^h 1— > i, 
and the Hamiltonian correction into the definition of iJ, i.e. iJ + <7L*L/2 H . This gives 

dUt^ ^LdAl - L*dAt-]^L*Ldt-iHdt^Ut, Uo = I, (11) 

which is a QSDE in the Hudson-Parthasarathy form like we encountered previously. Henceforth 
we will take this equation as our physical model. We note that in some situations it is also possible 
to obtain gauge (dAt) processes in the weak coupling limit this is particularly useful for the 
description of a strong off-resonant laser probe, which is essentially a scattering interaction. 

In classical models, the system dynamics is usually described by an SDE which determines the 
time evolution of the state of the system. A classical "system observable" , then, is any function 
f{xt) of the system state Xt- The flow jt of the dynamics then returns the random variable 
jt{f) = f{xt) corresponding to the observable / at time t. Due to the fact that quantum models 
are noncommutative we cannot give a sample path interpretation to the full model; in particular, 
there is no analogue of the system state Xt in quantum mechanics. On the other hand, we do have 
a well-defined concept of a system observable, which is just any observable of the initial system. 
The unitary solution Ut of Eq. Hll|l then provides a quantum analogue of the flow jt, given by 
jt{X) = U^XUt- This description of a stochastic model by a fixed probabihty measure (state) and 
time- varying random variables (observables), which is nearly universal in classical stochastics, is 
known as the Heisenberg picture in quantum mechanics. It is not difficult, using the Ito rules, to 
find a dynamical equation for jt 

djtiX)=jt{CL.HiX))dt + jti[L\X])dAt+jt{[X,L])dAl (12) 

This is the quantum analogue of the classical Ito formula for jt{f)- 

Having described the dynamics of the system and its interaction with the field, let us now turn to 
the observations that we can perform. Unlike in classical stochastic theory, where one observes some 
observable of the system, in quantum models an observation is generally performed in the field. 
From the system's perspective, the interaction with the field looks like an (albeit noncommutative) 
noisy driving force. Similarly, however, the field is perturbed by its interaction with the atoms, and 
carries off information as it propagates away after the interaction. By performing a measurement 
in the field, then, we can attempt to perfom statistical inference of the atomic observables. 

To calculate the perturbation of the field by the atoms we once again calculate U^YUt, where 
now, however, F is a field observable. The field observable of interest depends on the type of 
measurement we choose to perform. Without entering into the details, we mention two types 
of measurement that are extremely common in quantum optics: direct photodetection (photon 
counting), for which the observation at time t is given by = UtAtUt, and homodyne detection, 
for which = Ut*{At + A;)Ut (more generally Y^^ = U;{e-"^At + e^^A*t)Ut.) We refer to 
for a detailed treatment of quantum optical measurements. Using the Ito rules we obtain 

dYt^ = dAt + ML) dA*t + jt{L*) dAt + jtiL*L) dt, 

dY^ = jt{L + L*)dt + dAt + dA;. 
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Intuitively, it appears that 1^^ is like a Poisson process whose intensity is controlled by jt{L*L), 
whereas is a noisy observation of jt{L + L*). We cannot draw this conclusion, however, as 
jt{L) need not commute with At or A^, nor with itself at different times. 

It is essential, however, that the observation process commutes with itself at different times, and 
is hence equivalent to a classical stochastic process through the spectral theorem. An observation 
process that does not obey this property cannot be observed in a single realization of an experiment 
and is physically meaningless. Let us show that the observations processes we have defined above 
do obey this property, which is called the self-nondemolition property. Let Z be any operator of 
the form 7 (g) Z^j (g) 7 on W (g) J^g] ^ ^[s and let t> s. Then the Ito rules give directly 

u;zut = u:zus+ f u;cL,H{z)UrdT+ f u;[L\z]UrdA^+ f u;[z,L]UrdA;. 

J 3 J s J S 

Now let Z = As + A* ov Z = kg. In both cases jCl,h{Z) = [Z, L] = as L and 77 are system 
observables and Z is a field observable. Hence = uf{As + A*)Ut and Y^ = U^AsUt for all 
t > s. It is now easily verified, using the unitarity of Ut and the fact that Ag + A* and are 
commutative processes, that [Y^ , Y^] = [Y/^, Yj^] = for all t, s. Do note, however, that Y^ and 
Y/^ do not commute with each other; in any experimental realization, we can choose to perform 
only one of these measurements. 

Moving on to the next step in our program, we now wish to use the information gained from the 
measurement process to infer something about the initial system. To find a least mean square 
estimate of a system observable X at time t, given the observations 1* up to this time, we must 
calculate ni{X) = P{jt{X)\yt), where yi = vNiYg : < -s < t) is the (commutative) Von Neumann 
algebra generated by the observations history. The remainder of these notes is devoted to finding 
a recursive equation for 7rt(A) (the filtering equation). As we have discussed previously, however, 
this conditional expectation is only defined if jt{X) is in the commutant of 3^(, the intuitive 
interpretation being that statistical inference of an observable is only physically meaningful if 
the conditional statistics could possibly be tested through a compatible experiment. Through 
an entirely identical procedure as in the previous paragraph, we can show that jt{X) is in the 
commutant of 3^t for any system observable X . This is known as the nondemolition property. 

Remark. Unlike in a classical filtering scenario, we have not added any independent corrupting 
noise to the observations. Nonetheless, the filtering problem docs not trivialize to a problem with 
complete observations because the system is driven by a quadrature of the field that does not 
commute with the observations. Hence the problem of partial observations is intrinsic to quantum 
measurement theory, and does not result only due to technical noise in the detection apparatus. 
Additional technical noise is however straightforward to take into account as well; we will discuss 
this possibility in a later section. 

As a final note, we remark that we have now obtained a system-theoretic model of our system and 
observations. For example, the algebra {B (g) W, p® (j)) together with the pair 

d3t{X) = jt{CL,H{X)) dt + jt{[L*, X]) dAt + jt{[X, L]) dA^ 

dYt = ML + L*) dt + dAt + dA*t 

completely defines a system-observations model, in direct analogy to the system-observation models 
used throughout classical nonlinear filtering and stochastic control theory. 
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6 The reference probability method 



The goal of this section is to derive the quantum filtering equation of Belavkin, a recursive equation 
for TTt{X), using a method that is close to the classical reference probability method of Zakai |56| . 

Let us briefly recall the classical procedure of Zakai. In order to simplify the filtering problem, one 
starts by introducing a new measure, using a Girsanov transformation, under which the observation 
is a Wiener process. Then various (elementary) properties of the conditional expectation allow 
the filtering problem to be expressed, and solved, with respect to the new measure. Below we will 
apply this logic to the quantum filtering problem. 

The following filtering problem is considered in this section. Let Zt = At + and denote by At 
the commutant of Ct = vN(Zs : < s < t). With respect to the state ¥ = p (g) (f) on the algebra 
B ^W, we consider the system-observation pair 

djt{X) = jtiCLMiX)) dt + jti[L*,X]) dAt + jt{[X, L]) dA; 

dYt ^ jt{L + L*)dt + dAt + dA*t 

where jt{X) = UtXUt and Ut is given by Hll|l . We are interested in finding a recursive equation 
for 7Tt{X) = VijtiX)\yt) where 3^* = vN(n : < s < i). 

For future reference we first state two elementary properties of the conditional expectation. Let 
Kt be an adapted process in At- First, we claim that P{Ks\Ct) — P{Ks\Cs). This property follows 
from the fact that Ct = Cs ® C[s,t] and that Kg is independent from C[sj] by adaptedness. Second, 
conditional expectations and integrals can be exchanged as follows: 



These properties are immediate if Kt is a simple process, and a proof of the general case is not 
difficult. Note, however, that these properties would not be as straightforward if we were to 
replace Ct by yt- Unlike in the classical Ito theory, quantum Ito theory is grounded in the explicit 
representation of noise processes on Fock space; a concept such as adaptedness is defined with 
respect to the Fock space, rather than with respect to the integrator process. 

Though representation- free quantum Ito theories have been considered in the literature ITI, we 
choose for simplicity to stick to the Hudson-Parthasarathy theory. In this context, the discussion 
in the previous paragraph suggests that conditioning is most easily performed if we rotate the 
problem in such a way that the observations lie entirely in the Fock space. To this end we use 
another elementary property of conditional expectations: if J7 is a unitary operator and we define a 
new state Q{X) = V{U*XU), then F{U*YU\U*CU) = U*Q{Y\C)U (this can be verified using the 
definition of the conditional expectation.) In our setup, this implies that TTt{X) — UtQ*{X\Ct)Ut 
where we have defined the state Q\X) = ¥{U;XUt). 

Remark. The time-dependent state Q* is precisely the Schrddinger picture state; in this picture 
(used in TS^) the state evolves in time rather than the observables. This approach is ubiquitous 
in quantum mechanics but is less common in classical stochastics. Here we prefer to work in 
the Heisenberg picture; we only consider the state Q* for fixed time t as an intermediate step in 




(13) 
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obtaining the Kallianpur-Striebel formula below, which avoids the technical hurdles due to the 
explicit representation of quantum noises on the Fock space. 

We are now ready to get down to business. Inspired by the classical reference probability method, 
we seek a change of measure (change of state) that reduces the filtering problem to elementary 
manipulations. The following lemma describes how conditional expectations are related under a 
nondemolition change of state. 



Lemma 6.1: Let {JV,F) be a noncommutative probability space with normal state P. Let C be 
a commutative von Neumann subalgebra of Af and let A be its commutant, i.e. A = C := {N G 
Af; NC = CN, yC e C}. Furthermore, let V be an element in A such that F[V*V] = 1. We can 
define a state on A by ^[yl] := P[F*AT/] and we have 

r . , P\V*XV\C] 



Proof. Let K be an element of C. For all X ^ A, we can write 



''[V*XV\C]K 
''\v*VQ[X\C]K 

proving the lemma. 



[V*XKV] = Q[XK] 



[V*V\C]Q[X\C]K 



[X\C]K 



[X\C]K 



□ 



Remark. It is essential that V is an element of the commutant of C. The proof would not have 
worked if V were in Af. The conditional expectation can only be defined from the commutant onto 
C and in the first and fourth step of the proof we explicitly used that V is an clement of A. 

Our next goal is to find a suitable operator V in order to apply Lemma ffi. II to Q'^{X\Ct). Con- 
veniently we have already expressed Q*(X) as F{Uf XUt), and under P we can directly apply the 
result Eq. H13(l . However, Ut is in general not in At, so that Lemma Ffi. II can not be applied with 
V = Ut. This is best seen in equation (jSJ, where aside from Zt the incompatible noise i{At — A^) 
appears as one of the driving terms in the equation for Ut. The problem is resolved by the following 
technique, which to our knowledge first appeared in a paper by Holevo |84| . 

Lemma 6.2: Let Vt be the solution of the QSDE 

dVt = \^L{dA* + dAt) - ^L*Ldt - iHdt^Vt. (14) 

Then Vt G At and Q*(X) = F{Vt*XVt). 



Proof. Suppose without loss of generality that F{X) = {ip ^ ^,X ^p ^ ^) for some vector ip G Ti, 
(and <I> is the vacuum vector); as Ti. is finite-dimensional, we can always write any state p (8) as a 
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convex combination of vector states of this form. Now note that 
d{Uti^ ® $) = {LdA; - L*dAt - ^L*Ldt - iHdt^Uttp «) $ = 

{LUttp (X) $t]) «) - {L*Utip (E)^t])<E {dAt^[t) - ^L*LUttp <E) ^dt - iHUttp (E) ^dt = 

[LUtip ® *t]) ® + [LUti! ® $t]) ® (rfAt$[t) - ^L*LUttlJ E) ^dt - iHUtip (X ^dt, 

since dAt<^[t is 0. It follows that Ut ip ® ^ Vt ip ® <^ , and hence ¥{U;XUt) = V{Vt*XVt) for any 
X E B E) W. Moreover, Vt is an element of At', indeed, equation (|14|l is driven only by a single 
commutative noise At + A^. □ 

Putting these results together, we obtain a noncommutative version of the classical Kallianpur- 
Striebel formula. 

Corollary. (Noncommutative Kallianpur-Striebel) Let at{X) = U^FiV^* XVt\Ct)Ut. Then 

MX) = ^^, yXeB. (15) 

We now obtain an explicit expression for at{X). Using the quantum Ito rules 

V;XVt = X+ f v:CLMiX)Vsds+ f V:iL*X + XL)VsdiAs + A*J. 
Jo Jo 

Using Eq. we obtain 

v[v;xvt\Ct]^¥[x]+ f v[v;cL,Hix)Vs\Cs]ds+ f v[v;{l*x + xL)Vs\Cs]d{A, + a*). 

Jo Jo 
Another application of the quantum Ito rules now yields immediately the Belavkin-Zakai equation 

dat{X)=at{CLM{X))dt + at{L*X + XL)dYt. (16) 

Remark. It follows from the linearity and complete positivity of the conditional expectation that 
at is a linear positive functional on B, i.e. apart from the normalisation it is a (random) state. 
Hence Eq. (|lt)|l is a noncommutative analogue of the Zakai equation in classical filtering theory 
and propagates the unnormalized conditional state at- 

By applying the (quantum) Ito rules to the Kallianpur-Striebel formula, we obtain an expression 
for the normalized conditional state 



d-K 



{X) = TTt{LL.,H{X))dt + (TTt{L*X + XL) - T^t{L* + L)^t{X)) (dYt - ML* + L)dt) . (17) 



This Belavkin-Kushner-Stratonovich quantum filtering equation is a quantum analogue of the clas- 
sical Kushner-Stratonovich equation. 

We conclude this section with an investigation of the innovations process dZt = dYt ~ ""t {L* + L)dt. 

___ ____ 2 

The general theorem below ^| ^| shows that Zt is a martingale; but as dZ^ — dt it must be 
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a Wiener process by Levy's theorem (the latter can be appUed directly if we use the spectral 
theorem to find a classical representation of Zf) Also this property is in complete analogy with 
the innovations process in classical filtering theory. 

Theorem 6.3: Let dZt = atdAt + hdA^ + b^dAt and Zq = where a* e M and bt G C for all 
t > 0. DeEne the innovating martingale Zt by 

Zt ■.= U*ZtUt-(^J^ a,7r,(L*L) + 6,7r«(L*) + 6X(i)rfs). 

Then Zt is a martingale, i.e. for all t > s > we have F[Zt\ys] = Zg. 

Proof. We need to prove that P[Zt — Z^jX] = for alH > s > 0. This means we have to prove 
for allt> s>0 and K ey, that ¥[(Zt - Zs)K] = 0. This is equivalent to 

¥{U;ZtUtK) - F{U;Z,UsK) = V>(arTTr{L*L)K + br1Tr{L*)K + bl'Kr{L)K^ds, 

for ah t > s > and K ^yg. Since e J's it can be written as K = U*CUa = U^CUt for some 
C £ Cs- Furthermore, if C runs through Cs^ K runs through y^. For t = s the above equation is 
true, therefore it remains to show that for all C G Cg 

dF{U*ZtCUt) = V{arTTr{L*LC) + fe^7r^(i*C) + b*TTr{LC))dt 
= F{U*{arL*LC + brL*C + brLC)Ut)dt. 

But this is just an exercise in applying the quantum Ito rules. □ 



7 More examples 

7.1 Controlled quantum diffusions 

Up to this point we have considered filtering only for simple quantum systems interacting with a 
field. However, an important application of quantum filtering theory is in quantum control — we 
can feed back a signal to the initial system based on the observations we have made, in real time, 
in order to achieve some particular control goal. In control theory, the filter is often used as an 
intermediate step in generating the control signal. Before we can do this, however, we must show 
that the filtering equations retain their form even in the presence of feedback. This can be done 
in a natural way in the reference probability approach. 

The basic object we need is a controlled quantum diffusion. As before, we fix the algebra B (g) W. 
The controlled diffusion consists of two things: 

• The output noise Zt (we will choose Zt = At+At). 
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• The controlled Hudson-Parthasarathy QSDE^ 

dUt = jit dA; - l; dAt " ^LlLt dt - iHt dtj Ut 

where for all t, Ht and Lt are affiliated to B ® Ct {Ct = vN{Zs : < s < t)), and Ht is 
self- adjoint. 

As before, we find that this model gives rise to the system-observation pair 

djtiX) = jt{CL„H, {X)) dt + X]) dAt + Jt{[X, Lt]) dA*t 

dYt = jt{Lt + L*t) dt + dAt + dAl 
where it{X) ~ UtXUt is the flow and Yt — Ut ZtUt is the output process. 

To expose the connection with control, consider the simplest case where Lt = L G B is fixed and 
Ht = H ^ ut{Zs<t), i.e. iJ is a fixed Hamiltonian in B and Ut is some real function of the history 
of the output noise up to time t. Then we get 

djt{X) = utiY,<t)jtii[H, X])dt + jt{CLj{X)) dt + jt{[L\X]) dAt + jt{[X, L]) dA*t 

dYt = jt{L + L*)dt + dAt + dA; 

where it is important to note that by pulling it outside jt(-)j Ut becomes a function of the ouput 
process! This is just the model we considered before, with the difference that here the Hamiltonian 
term is modulated by a control signal which is some (arbitrary) function of the observations history. 
This is precisely the situation in (Hamiltonian) feedback control, the idea being that we choose 
the function ut so that a particular control goal is achieved. The general model of a controlled 
quantum diffusion allows us to make both H and L arbitrary functions of the observations history, 
and thus provides a general model for quantum systems with feedback. 

The question we ask here is, what is the form of the filter when we allow for feedback? It turns out 
that nothing much changes in the derivation of the filter using the reference probability method. 
We can follow all the same steps to obtain the Kallianpur-Striebel formula 

TTtiX) = P(jt(X)|vN(n : < s < t)) = at{X) = U:P{Vt* XVt\Ct)Ut 

with the C(-affiliated change of measure 

dVt = jit dZt - ^L;Lt dt - iHt dt^ Vt. 

By applying the quantum Ito rules, we now obtain the Zakai equation 

d(Tt{X) = at (££.,//. {X)) dt + <Jt{L;X + XLt) dYt. 

In particular, suppose that as before Lt — L, Ht = H®ut{Zs<t)- Note that f{Vti[Ht,X]Vt\Ct) = 
ut{Zs<t)P{Vti[H,X]Vt\Ct) as Vt commutes with Zs<t and by the module property of the condi- 
tional expectation. Thus in this case 

d<Jt(X) = Ut{Y,<t) atii[H, X]) dt + cTt{CLj{X)) dt + at{L*X + XL) dYt. 



^For existence and uniqueness, see e.g. 
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We see that as expected, the fiher for a quantum diffusion with feedback; has precisely the same form 
as the filter obtained previously with feedback added in a naive way. This provides a foundation 
for the use of the filter in quantum feedback control. 



7.2 Imperfect observations 

We consider the same filtering problem as in the section IHl except that now we add additional 
corrupting noise to the observation 

dYt = jt{L + L*) dt + dAt + dA; + K{dBt + dBl). 

Here k > and Bt is an additional field, the corrupting noise, which is independent of At] i.e., we 
have tensored another copy of the Fock space !F onto the Hilbert space, and Bt is the fundamental 
process on this space. We now take Zt — At + A^ + K{Bt + Bt); as before Ct — vN(Zs : < s < t) 
and At is its commutant. Note that we still have Yt = UtZtUt, etc. 

Up to and including the Kallianpur-Striebel formula, this case is identical to the one treated in 
sectional As before, we have 

Vt*XVt = X+ [ V:CL,H{X)Vsds+ [ v:iL*X + XL)Vsd{A,+Al). 
Jo Jo 

However, we cannot directly apply Eq. at this point, as in this case the stochastic integral 
with respect to Ag + A* does not correspond to the integral with respect to Zg . Let us write 

At + a; = aZt + [(1 - a){At + A*) - aK{Bt + B;)] , 

which holds for any a G M. Now note that if wc choose the particular value a — (f + k^)^^, 
then the noise Zt is independent from Mt = (1 — a){At + A^) ~ an{Bt + Bt). By an elementary 
argument, it follows that 

Ct] = 0. 







Ks dMs 

This, together with Eq. I|13() . directly gives the Belavkin-Zakai equation with imperfect observations 
datiX) ^ nt{CL.H{X))dt + [l + K^)-^at{L* X + XL)dYt. (18) 



7.3 Photon counting observations 



Once again we consider the same system; instead of homodyne detection, however, we now perform 
photon counting in the field: i.e., Yt — UtAtUt which gives 

dYt = dAt + 3tiL)dA*t + jt{L*)dAt + jt{L*L)dt. 

We would like to follow the same procedure as for homodyne detection. The following lemma, 
which replaces Lemma suggests how to proceed. The proof is identical to that of Lemma 
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Lemma 7.1: Let Uj. be the solution of the Hudson-Parthasarathy equation 

dUi = \^L'dA; - L'*dAt - ^L'*L'dt - iH'dt^U[ 
and let VI he the solution of 

dVl = ^L'{dKt + dA; + dAt + dt) - ^L'*L'dt - L'dt - iH'dt^V^. 
Then Vt G vN(A, + A^ + As + s : < s < t) and P(t//*Xt//) = P(F/*XF/). 

Define Zt — At + A^ + At + t, and as before At is the commutant of Ct — vN{Zs : < s < t) 
and yt = vN(Kj : < s < t). Lemma [7.11 directly provides us with a nondemolition change of 
measure, provided that we rotate our problem so that yt — U't*CtU't using a suitable unitary U'f 
Then, defining at(X) = U't*¥[Vl* XVl\Ct)Ui the KaUianpur-Striebel formula holds for (Jt{X). 

Define Wt as the solution of the QSDE 

dWt = [dAt - dA*t - ^dt]Wt 

(compare with Eq. ©•) Using the quantum Ito rules one can verify that At — WtZtWt- But recall 
that Yt = UtktUt = UtWt ZtWtUt] thus U't = WtUt is our rotation of choice. Another apphcation 
of the quantum Ito rules gives 

dU't = |(i - l)dA*t ~ (L* - \)dAt - ^{L*L + I - 2L + 2iH)dt^U't, 
which corresponds to the nondemolition change of measure 

dVl = |(i - l)dZt - ^{L*L -1 + 2iH)dt^Vt. 
For X Cz B, we obtain using the quantum Ito rules 

dVt'*XV; = v;* {CL.H{X))v;dt + VI*{L*XL - X)Vl{dZt - dt). 
Finally we obtain using the definition of dt, Eq. H13|l . and the quantum Ito rules 

dat{X) = at{LLM{X))dt+{at{L*XL) - nt{X)){dYt ~ dt). (19) 
which is the Belavkin-Zakai equation for counting observations. 

Using the KaUianpur-Striebel formula we can now obtain an expression for the normalized condi- 
tional state 

d^t{X) = T:t{CLMX))dt + f^^^vSr^ - ^*(^)) (^^* - ML*L)dt), 

\ nt[-L L) / 

which is the quantum filtering equation for photon counting. By Theorem 16.31 we see that the 
innovations process dZt = dYt — utiL* L)dt is a martingale. Note that this implies that in terms 
of the conditional state, Zt is a counting process with rate ■nt{L*L). 



24 



7.4 Other filters 



There is a wide array of other relevant examples, which we do not discuss here. For risk-sensitive 
filtering equations and their use in risk-sensitive control we refer to |36l I37| . Moreover, sometimes 
the input noise is not taken to be in the vacuum state, as is the case for squeezed or thermal noise. 
For a general treatment we refer to jl2j , and for the specific case of squeezed noise see ■ 
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